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Abstract: Discrete version of state transition algorithm is proposed in order to solve the traveling salesman problem. Three special 
operators for discrete optimization problem named swap, shift and symmetry transformations are presented. Convergence analysis 
and time complexity of the algorithm are also considered. To make the algorithm simple and efficient, no parameter adjusting is 
suggested in current version. Experiments are carried out to test the performance of the strategy, and comparisons with simulated 
annealing and ant colony optimization have demonstrated the effectiveness of the proposed algorithm. The results also show that the 
discrete state transition algorithm consumes much less time and has better search ability than its counterparts, which indicates that 
state transition algorithm is with strong adaptability. 
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1 Introduction 

As one of the most significant combinatorial optimiza- 
tion problems, traveling salesman problem (TSP) has been 
paid greatly attention and been extensively studied, due to 
its importance in manufacturing, distribution management and 
scheduling!]! 0], Traveling salesman problem can be de- 
scribed as: Given a set of n nodes and distances for each pair 
of nodes, find a roundtrip of minimal total length visiting each 
node exactly once, and according to whether the distance from 
node i to node j is the same as from node j to node i, the TSP 
can be symmetric or asymmetric yj. For a n nodes asymmet- 
ric TSP, there exists (ra-1)! possible solutions; while for the 
corresponding symmetric TSP, (n-l)!/2 solutions are possible. 
Because of its NP-hard property, emphasis has shifted from the 
aim of finding a global optimal solution to the goal of obtain- 
ing "good solutions" in reasonable time and establishing the 
"the degree of goodness"|4j|5[]. 

Taking the impractical exhaustive search for TSP into con- 
sideration, heuristic algorithms are introduced to speed up the 
process of finding a satisfactory solution, of which, simulated 
annealing (SA), tabu search (TS), genetic algorithm (GA), 
ant colony optimization (ACO), particle swarm optimization 
(PSO) have found widest applications in the the field 1 6 M 1011 . In 
terms of the concepts of state and state transition, a new heuris- 
tic search algorithm named state transition algorithm (STA) is 
proposed recently, and it exhibits good search performance in 
continuous space fill [Till . In the initial version of state tran- 
sition algorithm, a special transformation named general ele- 
mentary matrix was also proposed to solve the discrete opti- 
mization problem. In this paper, we will continue to develop 
its detailed forms to promote deep study of STA for solving 
traveling salesman problem. 

The paper is organized as follows: In section 2, we review 



the unified form of the state transition algorithm, and establish 
the discrete version of STA. Then detailed forms and imple- 
mentations of three special transformation operators are put 
forward in section 3, and we also give some strategies of pa- 
rameters setting. In the next section, some experiments are 
performed to test the performance of the proposed algorithm. 
Conclusion is derived in the end. 

2 State transition algorithm in discrete version 



As stated in 111 111 , a solution to a specific optimization prob- 
lem can be described as a state, the thought of optimization 
algorithms can be treated as state transition, and the process to 
update solutions will become a state transition process. With- 
out loss of generality, the unified form of state transition algo- 
rithm can be described as 



x k+ i = A k x k + B k u k 
Vk+i = f(xk+t) 



(1) 



YANG Chunhua's work is supported by the National Science Found for 
Distinguished Young Scholars of China (Grant No. 61025015), and ZHOU 
Xiaojun is supported by China Scholarship Council, who is the corresponding 
author of this paper. 



where x k £ Sft™ stands for a state, corresponding to a solution 
of a optimization problem; A k , B k 6 5ft™ x " are state transition 
matrixes, which are usually transformation operators; u k 6 5ft™ 
is the function with variables x k and history states; / is the cost 
function or evaluation function. 

As for traveling salesman problem, the cost / is usually ex- 
pressed as the function of a sequence, which corresponds to an 
ordered traveling route. That's to say, after a transformation 
using A k or B k , a new state x k +i should be a sequence too. 
Due to the particularity of TSP, only a state transition matrix is 
considered, avoiding the complexity of "adding" one sequence 
to another. 

According to the theory of Linear Algebra, we know that 
the special matrix must have only one position with value 1 in 
each column and each row. The special random matrix is called 
general elementary matrix, because it is derived from identity 
matrix and has the function of transforming a sequence into 
another one. For simplicity and specificity, G k is denoted in 
state transition algorithm for discrete optimization problem as 
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follows 

f x k+1 = G k x k 
{ Vk+i = f{xk+i) 

where x k = [xi k , x ik , ■ ■ ■ , x nk ] T , x ik e {1, 2, • • ■ , n} be a 
sequence, the G k is general elementary matrix with only po- 
sition value 1 in each column and each row. For instance, G k 
has the following styles llllll 
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If an initial sequence is [1,2,3,4,5], then after the above general 
elementary matrices, it will becomes [1,2,5,4,3], [4,2,3,5,1], 
[4,2,1,5,3], respectively. 

3 State transition operators and corresponding ad- 
justing strategies 



As can be seen from above, the general elementary ma- 
trix can have various types, and arbitrary types may have nei- 
ther validity nor efficiency to discrete optimization problem. 
To make the state transition process controllable and efficient, 
three special transformation operators for traveling salesman 
problem are defined. 
(1) Swap transformation 

x k+ i = Gl wap (m a )x k (3) 

where G s k wap G 5ft™ x ™ is called swap transformation matrix, 
m a is a constant called swap factor to control the maximum 
number of random positions to be exchanged, while the posi- 
tions are random. For example, if n — 5, m a = 3, the swap 
transformation matrix has the following styles 
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If an initial sequence is [1,2,3,4,5], then after the above 
swap transformation matrices, it will becomes [3,2,1,4,5], 
[1,3,2,4,5], [1,5,3,2,4], respectively. It can be found that the 
1th and 3th rows are exchanged, the 2th and 3th rows are 
exchanged, and the 2th, 4th, and 5th rows are interchanged, 
respectively, which indicates that the maximum number of 3 
positions are exchanged. 
(2) Shift transformation 

x k+ i = G s k hl,ft (m b )x k (4) 

where G^ 1 ^ £ 5ft™ x ™ is called shift transformation matrix, 
rrifc is a constant called shift factor to control the maximum 
length of consecutive positions to be shifted. By the way, the 
selected position to be shifted after and positions to be shifted 
are chosen randomly. To make it more clearly, let n — 5, m b — 
2, then the shift transformation matrix has the following styles 
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If an initial sequence is [1,2,3,4,5], then after the above 
shift transformation matrices, it will becomes [1,3,2,4,5], 



[1,3,4,2,5], [1,4,2,3,5], respectively. In the first two cases, the 
position to be shifted is {2}, and the positions to be shifted 
after are {3}, {4}, while in the third case, the positions to be 
shifted is {2,3}, and the position to be shifted after are {4}. 
In the last case, {2,3} are the maximum length of consecutive 
positions to be shifted. 
(3) Symmetry transformation 

x k+ i = G s k ym (m c )x k (5) 

where G s k m € 5ft™ x " is called symmetry transformation ma- 
trix, m c is a constant called symmetry factor to control the 
maximum length of subsequent positions as center. By the 
way, the component before the subsequent positions and con- 
secutive positions to be symmetrized are both created ran- 
domly. For instance, n = 5,m c = 1, then the symmetry 
transformation matrix has the following styles 

/ 1 0\/l \ / 1 \ 

01000 00001 01000 

00010 00010 00001 

0100 0100 00010 

^ o o o o iy\o i o o oy\o o 1 o oy 

If an initial sequence is [1,2,3,4,5], then after the above sym- 
metry transformation matrices, it will becomes [1,2,4,3,5], 
[1,5,4,3,2], [1,2,5,4,3], respectively. All of the three cases, the 
component before the subsequent positions is {3}. In the first 
two cases, the subsequent position is {0}, while the consec- 
utive positions to be symmetrized are {4} and {4,5}, respec- 
tively. In the third case, the subsequent position(or the center) 
is {4}, while the consecutive position to be symmetrized is 
{5}. It is not difficult to find that the length of subsequent po- 
sitions is 1 in the last case, which is the maximum length of 
subsequent positions as indicated by m c . 

To accept a new solution, "greedy criterion" is commonly 
adopted, in other words, only solution better than previous one 
is accepted. However, in SA, a worse solution is accepted 
probabilistically. Although the strategy in SA can jump out of 
stagnant state and it creates a big perturbation to previous so- 
lution, to guarantee the convergence of SA becomes difficult. 
In current version of state transition algorithm for discrete op- 
timization problem, "greedy criterion" is inherited. The core 
procedure of the STA for TSP can be outlined in pseudocode 
as follows 
i: repeat 

2: [BestJBest] op_swap(cities,BestfBest,SE,n,m a ) 
3: [BestJBest] ^— opshift(cities,Best,fBest,SE,n,mb) 
4: [BestJBest] <~ op_symmetiy{cities,Best,fBest,SE,n,m c ) 
5: until the specified termination criterion is met 
where, cities is the information about the TSP; the SE is the 
times of transformation, called search enforcement. As for 
more detailed explanations, op_swap function of above pseu- 
docode is also given in MATLAB scripts 

for i=l:SE 

Tranf = swap_matrix (n, ma) ; 
State (i,:)= (Tranf *Best' )' ; 

end 

[newBest , f GBest ] = fitness (State, cities, SE) ; 
if fGBest < fBest 
Best = newBest; 
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fBest = fGBest; 

end 

function y = swap_matrix (n, ma) 

y = eye (n) ; 

R = randperm (n) ; 

T = R(l:ma) ; 

S = T (randperm (ma) ) ; 

y(T, :) = y(S, :); 

4 Theoretical analysis of discrete STA 

Next, we analyze the convergence performance, the global 
search ability, and time complexity of the discrete state transi- 
tion algorithm. 

4.1 Convergence properties of discrete STA 

Firstly, we define the concept of convergence for discrete 
STA 

|/(a*)-/(z*)|<e,Vfc>iV (6) 

while x* is the global minimum solution of a traveling sales- 
man problem, e is a small constant, and N is a natural num- 
ber. If e > 0, we can say that the algorithm converges to a 
e — optimal solution; if e = 0, we can say that the algorithm 
converges to a global minimum. However, if x* is a local min- 
imum solution, then we can say that the algorithm converges 
to a e — suboptimal solution and a local minimum for e > 
and e = 0, respectively. 

Theorem 1 the discrete STA can at least converge to a local 
minimum. 

Proof. Let suppose the maximum number of iterations (de- 
noted by M) is big enough, then there must exist a number 
N < M, when k > N, no update of the better solution will 
happen. That is to say, f(xk) = f(x best ),V k > N, where 
x best j s jjj e so i u tion in the Mh iteration. The x hest is just the 
local minimum solution as x*, and \f(x k ) - f(x best )\ = 0. 

4.2 Global search ability of discrete STA 

It is not difficult to find that whether the discrete STA con- 
verges to a global minimum depends on the x best in the Mh 
iteration. If the x best is the global minimum, then according to 
the "greedy criterion" we use to update the best in pseudocode, 
when k > N, best will always be x best . In other words, the 
global convergence performance has much to do with the three 
operators we have designed. 

Theorem 2 the discrete STA can converge to a global mini- 
mum in probability. 

Proof. Let suppose x* = (ai, ■ • ■ , a n ) is the global mini- 
mum solution, and x k = (£>i, • • • , b n ) is the £th best solution. 
Then we discuss separately that how the state Xk can transform 
to the best state x* . 

If there exists the same sub-sequence in both Xk and x*, for 
instance, x k = {a n ,a 2 , • • • , a„_i, ai), then both swap trans- 
formation with small swap factor and appropriate shift trans- 
formation have the probability to swap or shift other positions 
in the sequence so that other positions in Xk can transform ex- 
actly to the corresponding positions in the x*. 



If there exists no same sub-sequence in xu and x* , for in- 
stance, Xk = (a n -\, a„_2, • • • , a<2, then swap transforma- 
tion with big swap factor and proper symmetry transformation 
have the probability to swap or symmetrize all positions in the 
sequence so that all positions in Xk can transform exactly to 
the corresponding positions in the x* . 

4.3 Time complexity of discrete STA 

Because of the NP-hard property of TSP, it is impossible 
to solve the problem in polynomial time. As described in the 
section 1, to obtain a "good solution" in a reasonable time is a 
optional choice, and this is the same case as the discrete STA, 
which aims at getting a satisfactory solution in as short a time 
as possible. In the pseudocode as described above, we can find 
that in the out loop, there are M iterations, and in the inner 
loop, there exist three times of SE transformations. It is not 
difficult to find that the time complexity of the discrete STA 
is 0(M ■ SE), that is to say, the discrete STA can achieve a 
global optimum in polynomial time in probability. 

5 Experimental results and discussion 

From the MATLAB scripts in section 3, we can find that as 
the m a increases, the exchanging transformation matrix will 
become less efficient. The same phenomenon can be observed 
in other two transformations, and repetition of the transforma- 
tion matrices appears frequently. 

For simplicity and efficiency, in current version of STA for 
TSP, parameters for three transformation operators are made 
as smaller as possible, that is to say, m a = 2, nib = 1, m c = 
are specified in the paper, to avoid the trouble of parameters' 
adjusting. 

In order to test the performance of the proposed STA in 
discrete version, some benchmark symmetric traveling sales- 
man problems from [3] are utilized for experiment, including 
ulysses!6.tsp, att48.tsp and berlin52.tsp. In the same time, 
SA, ACO, which are recognized as distinguished algorithms 
for TSP are used for comparison with STA. In SA, we set ini- 
tial temperature at 5000, cooling rate at 0.97, and in ACO, 
a = = 5,p = 0.9, where, a, (3 are used to control the 
relative weight of pheromone trail and heuristic value, and p 
is the pheromone trail decay coefficient. In ACO and STA, 
the number of ants or the search enforcement is 20, and max- 
imum iterations is fixed at 200. Considering that SA is not 
usually population based algorithm, the maximum iteration is 
extended especially for fairness. The threshold, or the total 
number of iterations in SA is set at 4000. 

Programs are run independently for 20 trails for each al- 
gorithms in MATLAB R2010b (version of 7.11.0.584) on In- 
tel(R) Core(TM) i3-2310M CPU @2.10GHz under Window 
7 environment, and comparison results for STA with SA and 
ACO are listed in Table 1. Some statistics and the run time 
are utilized to evaluate the performance of algorithms. The 
best means the minimum of the results, the worst indicates the 
maximum, and then it follows the mean, st.dev. (standard de- 
viation). The run time is the average time used in 20 trails, 
which is measured in seconds. 

As can be seen from the Table, STA outperforms SA and 
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Table 1 : Results for benchmark test problems 

Problems Performance SA ACO STA 

best 73.9998 74.6287 73.9876 

mean 74.4481 76.0864 74.0779 

ulyssesl6.tsp worse 75.5391 78.7728 74.5939 

st.dev. 0.4105 1.1062 0.1626 

time(s) 2.9975 11.3038 1.2223 






best 


3.5266e4 


3.7015e4 


3.3724e4 




mean 


3.9667e4 


3.8449e4 


3.4872e4 


att48.tsp 


worse 


4.5887e4 


3.9801e4 


3.6205e4 




st.dev. 


2.7453e3 


862.4546 


668.7553 




time(s) 


14.7605 


102.4784 


3.0462 




best 


8.1864e3 


8.2404e3 


7.5444e3 




mean 


8.9838e3 


8.7776e3 


8.2472e3 


berlin52.tsp 


worse 


9.5858e3 


9.1513e3 


8.6305e3 




st.dev. 


380.1004 


267.1124 


273.4509 




time(s) 


139.8399 


118.0948 


3.3438 



ACO in almost every performance index. To be more spe- 
cific, the results of each benchmark problem are discussed sep- 
arately in the following. 

ulyssesl6.tsp: The best of the results is obtained by STA, 
with the sequence of (7 6 14 13 12 16 1 3 2 4 8 15 5 1 1 9 
10), which gets the best length of route at 73.9876, and the 
best route is plotted in Fig.l. To be more careful, we can find 
that the worst solution gained by STA is even better than the 
best of ACO, which indicates the strong search capability of 
STA. The st.dev. of STA is almost approaching zero, and it 
shows that STA is also stable for this test problem. By the way, 
the STA consumes the half time of SA, and 1/9 time of ACO. 
The curves of the average fitness are illustrated following in 
Fig. 2. The data created by SA are condensed to the same iter- 
ations(the same method applied to other two results obtained 
by SA). It is interesting to find that the ups and downs of the 
curve of SA in the early stage, because SA accepts a relatively 
worse solution by probability. Take the condensed data of SA 
into consideration, we can perceive that SA needs quite a long 
time for its steadily descending trend. However for ACO, af- 
ter quickly to each a good fitness, it is trapped into stagnation 
point. But for STA, neither of the phenomena occurs, and it 
keeps decreasing before not a short time. 

ulysses16.tsp 
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Fig. 1: the best route of ulyssesl6.tsp obtained by STA 



att48.tsp: STA also acheives the best, with the sequence of 
(9 40 15 12 1 1 23 3 22 16 41 34 48 5 29 2 42 26 4 35 45 10 
24 32 39 25 14 13 21 47 20 33 46 36 30 43 17 27 19 37 6 28 
7 18 44 31 38 8 1) and length of route at 3.3724e4, which is 
illustrated in Fig. 3. The same situation is observed in the re- 
sults between ACO and STA, that is, the worst solution gained 
by STA is better than the best of ACO. SA outperforms ACO 
in the best, but mean and worst in ACO are better than that 
of SA. In this case, the st.dev. of STA is not very satisfactory, 
although it is better than its counterparts. The time costed by 
STA is much shorter than other two algorithms, only around 
1/4 and 1/30 time of that of SA and ACO consume, respec- 
tively. From Fig. 4, we can find low degree of ups and downs in 
the curve of SA, and SA keeps good descending fitness in the 
first stage. ACO confronts the premature convergence again, 
few changes have happened in the late process. For STA, the 
fitness decrease sharply in the early stage, but can still keep de- 
creasing in the later, which indicates the excellent performance 
of the designed operators in the discrete STA. 



att48.tsp 

6000 r 




1000 2000 3000 4000 5000 6000 7000 8000 



Fig. 3: the best route of att48.tsp obtained by STA 

berlin52.tsp: The STA wins the best again, with the best 
sequence of (3 17 21 42 7 2 30 23 20 50 29 16 46 44 34 35 
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2 1 , 1 1 1 
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Iterations 

Fig. 4: curve of the average fitness for att48.tsp 



36 39 40 37 38 48 24 5 15 6 4 25 12 28 27 26 47 13 14 52 
11 51 33 43 10 9 8 41 19 45 32 49 1 22 31 18) and length 
of route at 7.5444e3, which can be observed in Fig. 5. At this 
time, ACO exhibits much better than SA in other performance 
except for the best. But, STA achieves the best results on the 
whole, especially for the computational time. For the problem, 
STA consumes respectively 1/40 and 1/35 of the time costed 
by SA and ACO. As for TSP, the time complexity is really 
important, so the results gained by STA shows again that the 
discrete STA is really promising. In Fig. 6, we can find that 
SA need quite a long time to reach a relatively good fitness, 
and then it becomes stagnated. On the contrary, the fitness of 
STA can decease quickly and keep descending till the end of 
the process. 



berlin52.tsp 




200 400 600 600 1000 1200 1400 1600 1600 



Fig. 5: the best route of berlin52.tsp obtained by STA 

6 Conclusion 

Different from continuous search space, the space for travel- 
ing salesman problem is discrete, which corresponds to a per- 
mutation of a sequence. In discrete version of state transition 
algorithm, three special state transition operators are designed 
to manipulate the permutation. A simplest but much efficient 



berlin52 




0.5 1 1 1 1 

50 100 150 200 

Iterations 

Fig. 6: curve of the average fitness for berlin52.tsp 

parameters control strategy is proposed, in which, no param- 
eters of the operators need adjusting. Some experiments are 
done to evaluate the proposed algorithm, and the results show 
that the discrete version of STA has much better performance 
not only in the search ability but also in the time consuming. 

Premature phenomenon is extensively existing in heuristic 
algorithms. To escape from a stagnation point, the paper fo- 
cuses on the designing of operators. Accepting a relatively 
worse solution is a good idea, as seen in SA; however, it in- 
creases the computing time and risks the non-convergence. It 
is really excited to see the fantastic performance of discrete 
STA only with "greedy criterion", due to the excellent oper- 
ators designed. On the other hand, difficulties are still to be 
confronted with large size problem, and the adjusting of the 
parameters need reconsidering. By the way, in current version 
of STA, three transformations are only for local permutation, 
and effective global permutation will be found in our future 
work as well as the equilibrium between them. 
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